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ABSTRACT 

In this paper, the problem of false information injection attack 
and defense on state estimation in dynamic multi-sensor sys¬ 
tems is investigated from a game theoretic perspective. The 
relationship between the Kalman hlter and the adversary can 
be regarded as a two-person zero-sum game. Under which 
condition both sides of the game will reach the Nash equilib¬ 
rium is investigated in the paper. The multi-sensor Kalman 
hlter system and the adversary are supposed to be rational 
players. The Kalman hlter and the adversary have to choose 
their respective subsets of sensors to perform system state es¬ 
timation and false information injection. It is shown how both 
sides pick their strategies in order to gain more and lose less. 
The optimal solutions are achieved by solving the minimax 
problem. Numerical results are also provided in order to il¬ 
lustrate the effectiveness of the derived optimal strategies. 

Index Terms — Game theory, malicious attack, state esti¬ 
mation, Kalman hlter, multi-sensor systems 

1. INTRODUCTION 

System state estimation in the presence of an adversary that 
injects false information into sensor readings is an important 
problem with wide application areas, such as target track¬ 
ing with compromised sensors, secure monitoring of dynamic 
electric power systems and radar tracking and detection in the 
presence of jammers. This topic has attracted considerable 
attention and interest recently m EE] a Sin Cl. In Q, the 
problem of how to take advantage of the power system con- 
hguration to introduce arbitrary bias to the system was inves¬ 
tigated. In lE, the authors showed the impact of malicious 
attacks on real-time electricity market and how the attackers 
can make proht by manipulating certain values of the mea¬ 
surements. The relationship between the attackers and the 
control center was discussed in 0, where both the adver¬ 
sary’s attack strategies and the control center’s attack detec¬ 
tion algorithms have been proposed. False data attacks on the 
electricity market have also been investigated in ii and 0. 
In El, the data frame attack was formulated as a quadratically 


constrained quadratic program (QCQP). In Q, the relation¬ 
ship between a target and a MIMO radar was characterized 
as a two-person zero-sum game. However, in the aforemen¬ 
tioned publications, only the problem of static system state 
estimation has been considered. 

We are interested in dynamic system state estimation and 
in 0, we have studied the impact of the injected biases on 
a Kalman hlter (KF)’s estimation performance, showing that 
if the false information is injected at a single time, its impact 
converges to zero as time goes on; if the false information 
is injected into the system continuously, the estimation er¬ 
ror tends to reach a steady state. In 0, we have found the 
best strategy for the adversary to attack the Kalman hlter sys¬ 
tem from the perspective of the trace of the mean squared 
error (MSB) matrix, and obtained some close-form results. 
We have also studied how the attacker can maximize the de¬ 
terminant of the Kalman hlter’s estimation MSB matrix in 
cni. Based on our previous work, in this paper our goal is 
to use game theory to investigate the relationship between the 
Kalman hlter and the attacker. The Kalman hlter (the de¬ 
fender) and the attacker are supposed to be rational players. 
The trace of the state estimation MSB is used to construct 
the payoff matrix, and the problem can be characterized and 
solved as a minimax problem. Numerical results show the ef¬ 
fectiveness of the optimal mixed defense strategy for the KB 
against the adversary’s attacks. 

2. SYSTEM MODEL 

The discrete-time linear dynamic system can be described as 

Xfc+i = FfeXfc-f GfcUfc-f Vfc (1) 

where is the system state transition matrix, x^ is the sys¬ 
tem state vector at time k, is a known input vector, Gfc is 
the input gain matrix, and is a zero-mean white Gaussian 
process noise with covariance matrix E[vkvJ] = Q^. Bet 
us assume that M sensors are used by the linear system. The 
measurement at time k collected by sensor i is 


( 2 ) 


with Hfc i being the measurement matrix, and ^ a zero- 
mean white Gaussian measurement noise with covariance ma¬ 
trix for i = 1, • • ■ ,M. We further 

assume that the measurement noises are independent across 
sensors. The matrices F^, G^, i, Qfc, and are as¬ 
sumed to be known with proper dimensions. For such a linear 
and Gaussian dynamic system, the Kalman hlter is the opti¬ 
mal state estimator. In this paper, we assume that a bias bfc ^ 
is injected by the adversary into the measurement of the ith 
sensor at time k intentionally. Therefore, the measurement 
equation becomes 

Zfc j -f vfk,i + hk,i = Zk,i + (3) 

where zj. ^ is the corrupted measurement, ^ is either an un¬ 
known constant or a random variable independent of {v^ ^} 
and {wfe^i}. 


3. IMPACT OF FALSE INFORMATION INJECTION 

Let us first assume that the adversary attacks the system by 
injecting false information into the sensors while the Kalman 
filter is unaware of such attacks. We start with the case where 
biases (bfc) are continuously injected into the system starting 
from a certain time K. Note that single injection is just a 
special case of continuous injection when b^ are set to be 
nonzero at time K and zero otherwise. 

In the continuous injection case, the Kalman hlter’ extra 
mean square error (EMSE), which is caused by the continu¬ 
ous bias injection alone, is derived in HD and provided as 
follows. 

Proposition I. When the bias sequence {bfc} is zero mean, 
random, and independent over time, the EMSE at time 
K + N due to the biases injected at and after time K, de¬ 
noted as Ak+N, is Ak+N = 

where Dm = Wic+iv-m, and Bk = 

(I - Fif_i. = 1 is an identity 

matrix, 'W k is the Kalman filter gain 1 1721? . and I^K-i-N-m is 
the covariance matrix ofhx+N-m- 

InS, we investigated the optimal attack strategy that an 
adversary can adopt to maximize the system estimator’s esti¬ 
mation error. The problem can be formulated as a constrained 
optimization problem. Without loss of generality, let us as¬ 
sume that the attacker is interested in maximizing the system 
state estimation error at time K right after a single false bias is 
injected at time K. In this case, we are interested in designing 
the injected random bias’ covariance matrix such that 

maxTr [Pk\k + AkC^k)] 

s.t. Tr(Sif) = (4) 


hlter’s state covariance matrix at time K in the absence of 
any false information. Eor both the cases where the attacker 
injects independent noises and dependent noises to position- 
only sensors in an object tracking system, we have derived 
the optimal strategies to maximize the trace of the state esti¬ 
mation MSE matrix as provided in the following two propo¬ 
sitions 0. 

Proposition 2. For a system with M sensors, if the adver¬ 
sary injects independent random noises, the best strategy is 
to allocate all the power to the sensor with the smallest mea¬ 
surement noise variance. 

Proposition 3. For a system with M sensors, the optimal 
strategy for the adversary is to inject dependent random 
noises with a pairwise correlation coefficient of 1. The 
noise power is allocated such that at,. = . , i £ 

V 52j=i 

(1, • • • , M}, where ab^ is the standard deviation (s.d.) of the 
noise injected to the ith sensor, c, = - [wt -^ 

E". (i/<) 

CTwi is the ith position-only sensor’s measurement noise s.d. 


4. A TARGET TRACKING EXAMPLE 


In this paper, we give a concrete target tracking example, and 
assume that the target moves in a one-dimensional space ac¬ 
cording to a discrete white noise acceleration model ifT^ . 
which can still be described by the plant and measurement 
equations provided in ([D and (|2]i. In such a system, the state 
is defined as Xk = [^fc ^fc]^, where and ^fc denote the tar¬ 
get’s position and velocity at time k respectively. The input 
Ufc is a zero sequence. The state transition matrix is 


Ffc 


1 A 
0 1 


Vfc 


(5) 


where A is the sensor sampling interval. The process noise is 
Vfc = Fufc, where Vk is a zero mean white acceleration noise, 
with variance cr^, and the vector gain multiplying the scalar 
process noise is given by F = [A^/2 A] The covariance 
matrix of the process noise is therefore Q = alTT'^. The 
observation matrix in {2} is given as 


Hfc,, = [1 0] Vfc, i (6) 


Once the system model is known, it is straightforward for both 
the Kalman hlter and the adversary to calculate the Kalman 
hlter’s state covariance matrix P^iic as in lfT2l . Using Prepo¬ 
sition 1, we can obtain the trace of the total state estimation 
MSE matrix: 


where a is a constraint on the power of the injected noise, ^ 

Tr(-) is the matrix trace operator, and Pk\k is the Kalman Tr(MSE) = TxiV k\k + W/c(7) 







5. NONCOOPERATIVE TWO-PERSON ZERO-SUM 
GAME 

In a noncooperative two-person zero-sum game ifTSl . we as¬ 
sume that there are two players, referred to as Players 1 and 
2, and an to x n payoff matrix L = {kj}. Each entry of the 
matrix is an outcome of the game corresponding to a partic¬ 
ular pair of decisions made by both players. Player 1 gets to 
rows of the matrix as his/her strategy set, while for Player 2, 
the strategy set is the corresponding n columns of the same 
matrix. 

In our problem, suppose there are totally M sensors, the 
Kalman filter and the adversary can choose any non-empty 
subsets of sensors to perform state estimation and attack re¬ 
spectively, which means m = n = 2 ^ — 1 . Lisa square 
matrix of the size {2^ — 1) x {2^ — 1). The payoff in the 
game between the Kalman filter system and the adversary will 
be the trace of the state estimation MSE matrix. Eor each set 
of sensors he/she chooses to attack, the adversary is under a 
total injected noise power constraint as specified in (HJi. The 
Nash equilibrium between the Kalman filter and the adversary 
is achieved by solving the minimax optimization problem. 

Let {row i, column be a pair of strategies adopted by 
the players, and the corresponding outcome (payoff) be lij, 
which means that Player 1 should pay Player 2 the amount 
of lij. If li*j < li*j<- < lij’^, for alH = 1,... ,to and all 
j = 1 ,..., n, the pair {**, j*} is said to constitute a saddle- 
point equilibrium, and the game is said to have a saddle point 
in pure strategy. On the other hand, if the pair of inequalities 
does not exist, one can derive the mixed strategy to obtain the 
equilibrium. A mixed strategy is a probability distribution on 
the space of the player’s pure strategies. A mixed strategy 
allows for a player to select a pure strategy randomly with 
a certain probability. In this case, the utility function u is 
defined as 

m n 

y) = X! X! (8) 

i=l j=l 

where x and y are the probability distribution vectors for the 
mixed strategies. Also, x G X, y G Y, where the set X = 
(x G i?"* : X > 0, — 1}’ ^ defined in 

the same way. The Kalman filter playing as defender is trying 
to minimize the utility function u(x, y) by choosing the best 
defending strategy, while the attacker wants to maximize the 
utility function by choosing the best attack strategy. Eor the 
payoff matrix L of size to x n, a vector of x* is the best mixed 
strategy for the Kalman filter if 

{7m(L) = max(x*)^Ly < maxx'^Ly,x G X (9) 
yex yex 

The C/m(L) is known as the average security level (loss ceil¬ 
ing) of the defender, the average security level (gain-floor) of 
the attacker can also be defined as below, 

[/^(L) = minx^Ly* > minx’^Ly,y G Y (10) 


It always holds that {/^(L) = LLmC^) for mixed strate¬ 
gies in noncooperative two-person zero-sum game. The 
saddle point in the mixed strategies is defined when the two 
bounds are equal to each other, which can be found by solving 
the following linear programming problem US: 

min (11) 

xGX 

s.t. L^x < &„1 

x^l = l 
X > 0 

where bu denotes a constant upper bound. Eor the attacker, 
the formula is the other way around, 

max bi (12) 

yex 

s.t. Ly > 6/1 

y^i = i 

y>0 

where 6/ denotes a constant lower bound. Erom the formula¬ 
tion above, it is easy to see that (fT2l) is the dual form of the 
optimization problem (fTTI) . The optimal function for the two 
problems are the same. Interested readers are referred to Ol 
for more details. 


6. NUMERICAL RESULTS 


In the example, for simplicity and ease of presentation, we 
assume that there are three sensors denoted as {21,^2,23} 
in the system having independent measurement noises with 
noise standard deviations = 3, 0-^,2 = 4, = 5. The 

system process noise s.d. is cr„ = 0.5, sensors’s sampling in¬ 
terval is A = Is, and the system initial state Xq is assumed 
to follow a Af{xo\o, Po|o) distribution, where a:o|o = [1 1]^ 
and 


P o|o — 


0.25 0.25 
0.25 0.5 


The adversary can choose any combination of sensors 
from the set Pi = {zi, Z 2 , Z 3 , Z 1 Z 2 , Z 1 Z 3 , Z 2 Z 3 , Z 1 Z 2 Z 3 } to 
attack with the power constraint of = 100, where 

abi is the s.d. of the random noise injected to Sensor i. Like¬ 
wise, the defender can choose any combination of sensors 
to perform state estimation, and its strategy set is the same: 
P 2 = Pi- The game is played as below: if the defender uses 
data from Sensors i and j for state estimation, while the ad¬ 
versary attacks Sensors i and k, then system state estimation 
is affected by the false information from the ith sensor only. 

In this game, the trace of the state estimation MSE ma¬ 
trix is regarded as the payoff of the game. In the games of 
the independent and dependent attacks, the system is attacked 
according to the strategies provided in Propositions |2] and [3 




Table 1. Payoff Matrix (Independent Case) 


KF/At 

Zl 

2:2 

23 

^1^2 

ZIZ3 

2223 

21^223 

Zl 

25.4 

4.7 

4.7 

25.4 

25.4 

A.l 

25.4 

Z2 

7.2 

23.5 

7.2 

7.2 

7.2 

23.5 

7.2 

Z 3 

10 

10 

23.6 

10 

10 

10 

10 

ZlZ2 

13.5 

6.6 

3.4 

13.5 

13.5 

6.6 

13.5 

Z1Z3 

16.4 

3.8 

5.4 

16.4 

16.4 

3.8 

16.4 

Z2Z3 

5.0 

12.4 

8.0 

5.0 

5.0 

12.4 

5.0 

Z1Z2Z3 

10.2 

5.2 

3.9 

10.2 

10.2 

5.2 

10.2 


Table 2. Payoff Matrix (Dependent Case) 


KF/At 

21 

22 

23 

212:2 

2123 

2223 

212223 

21 

25.4 

4.7 

4.7 

13.2 

15.9 

4.7 

10.3 

22 

7.2 

23.5 

7.2 

9.3 

7.2 

13.3 

8.6 

23 

10 

10 

23.6 

10 

11.0 

12.1 

10.5 

212:2 

13.5 

6.6 

3.4 

16.7 

12.4 

5.6 

15.6 

2123 

16.4 

3.8 

5.4 

15.0 

18.1 

4.2 

15.0 

2223 

5.0 

12.4 

8.0 

6.8 

5.3 

15.5 

8.2 

21222:3 

10.2 

5.2 

3.9 

12.5 

11.1 

6.2 

13.4 


respectively. Let us assume that the adversary attacks the sen¬ 
sors at time k = 100, and the payoff matrix is given in Ta¬ 
bles [U and |2] From Tables [T] and |2l we can see that there 
is no pure strategy Nash Equilibrium. Instead, we use mixed 
strategies to find the Nash Equilibrium. In order to obtain the 
optimal probability distribution vector, we solve the optimiza¬ 
tion problem formulated in (fTTl i. The solution to (fTTI) is the 
optimal probability vector for the defender, and the dual so¬ 
lution is the optimal mixed strategy for the attacker. The op¬ 
timal solutions for independent- and dependent-attack cases 
are shown in Tables |3and|4]respectively. 

Eor the independent case, we can see from Table [T] that 
( 6 , 6 ) and (7,7) elements of the payoff matrix (L) are the 
smallest among the seven diagonal elements. This means that 
in the worst cases for the KE when its chosen sensor com¬ 
bination happens to be the same as that being attacked by 
the adversary, the strategies Z 2 Z 3 and Z 1 Z 2 Z 3 will lead to the 
smallest state estimation MSEs. In addition, for the KE, the 
values of last two rows are relatively small. As a result, for 
the KE, the probabilities of the last two strategies ( 2 : 22:3 and 
Z 1 Z 2 Z 3 ) are much larger than those of other strategies, which 
are shown in Table |3] 

In the dependent case, for the KE, the probabilities for the 
last two pure strategies ( 2 : 22:3 and Z 1 Z 2 Z 3 ) are relatively large 
as shown in Table |4] This can be explained similarly as in 
the independent case. In L, the entries of the rows corre¬ 
sponding to Z 3 , Z 1 Z 2 , and 2 : 12:3 are relatively large, so the KE 
assigns nearly zero probabilities to these three strategies. In 
the first two rows of L, even though the diagonal elements are 
large, the rest of the elements are relatively small, so strate¬ 
gies 2:1 and Z 2 are assigned significant probabilities for the KE 
as shown in Table |4] 

We also provide a simulation result to demonstrate the op¬ 
timality of the derived strategy. In this example, four different 


Table 3. Optimal Strategy Probabilities (Independent Case) 


Player 

21 

22 

23 

2122 

2123 

2223 

212223 

KF 

0.00 

0.00 

0.00 

0.00 

0.00 

0.40 

0.60 

Attacker 

0.14 

0.22 

0.00 

0.14 

0.14 

0.22 

0.24 


Table 4. Optimal Strategy Probabilities (Dependent Case) 


Player 

21 

22 

23 

2122 

2123 

2223 

212223 

KF 

0.16 

0.14 

0.00 

0.00 

0.00 

0.37 

0.33 

Attacker 

0.14 

0.02 

0.00 

0.00 

0.00 

0.34 

0.50 


scenarios are explored: 1) there is no attack; 2) the KE uses 
the optimal mixed strategy; 3) the KE uses a mixed strategy to 
pick each pure strategy with an equal probability 1/7; 4) the 
KE always chooses the first pure strategy. In Scenarios 2)- 
4), the attacker injects false information according to his/her 
optimal mixed strategy to the sensors at time k = 100. The 
resulting position estimation MSEs are plotted in Eig. □ It 
is clear that the optimal mixed strategy provides the best de¬ 
fense against the attacker, with the minimum increase in the 
MSE after the attack. 



Fig. 1. Optimal Mixed Strategy vs. Other Options 


7. CONCLUSION 

In this paper, we investigated the relationship between the 
Kalman filter and the adversary in a two-person zero-sum 
game. The Kalman filter (defender) tries to achieve more ac¬ 
curate system state estimation and avoid being attacked by 
the adversary. The adversary tries to mislead the Kalman fil¬ 
ter as much as possible. Both sides of the game will reach a 
Nash Equilibrium through the mixed strategies. Using min¬ 
imax techniques, we found the mixed strategy saddle point 
in the game. In the future, we will put more practical con¬ 
straints in our problem by letting both players in the game 
have limited information about the other player and introduce 
the detection mechanism to the Kalman filter system. 
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